Gaiku : Generating Haiku with Word Associations Norms
نویسندگان
چکیده
creativeness / a pleasing field / of bloom Word associations are an important element of linguistic creativity. Traditional lexical knowledge bases such as WordNet formalize a limited set of systematic relations among words, such as synonymy, polysemy and hypernymy. Such relations maintain their systematicity when composed into lexical chains. We claim that such relations cannot explain the type of lexical associations common in poetic text. We explore in this paper the usage of Word Association Norms (WANs) as an alternative lexical knowledge source to analyze linguistic computational creativity. We specifically investigate the Haiku poetic genre, which is characterized by heavy reliance on lexical associations. We first compare the density of WAN-based word associations in a corpus of English Haiku poems to that of WordNet-based associations as well as in other non-poetic genres. These experiments confirm our hypothesis that the non-systematic lexical associations captured in WANs play an important role in poetic text. We then present Gaiku, a system to automatically generate Haikus from a seed word and using WAN-associations. Human evaluation indicate that generated Haikus are of lesser quality than human Haikus, but a high proportion of generated Haikus can confuse human readers, and a few of them trigger intriguing reactions. ∗ Supported by Deutsche Telekom Laboratories at BenGurion University of the Negev. † Supported by the Lynn and William Frankel Center for Computer Sciences.
منابع مشابه
Haiku Generator that Reads Blogs and Illustrates Them with Sounds and Images
In this paper we introduce our haiku generator, which, in contrast to other systems, is not restricted to limited classic vocabulary sets and preserves a classic style without becoming too random and abstract because it performs a semantic integrity check using the Internet. Moreover, it is able to analyze blog entry input and, by using nouns and adjectives for web-mining, to stay on topic and ...
متن کاملStrong Associations Can Be Weak: Some Thoughts on Cross-lingual Word Webs for Translation
This paper discusses the implications of human word association norms on the modelling of word associations from large corpora and the relevance of different types of associations in the process of translation, with a focus on adjectives. It is observed that the proportion of paradigmatic responses found in English norms tends to be higher, whereas a clear preference for syntagmatic association...
متن کاملVerb Disambiguation through the Fusion of Two Independent Systems
This paper presents the fusion of two independent systems, namely TSGR (Target Sequence Generation by Re nement) and D&H (DIPETT & HAIKU). The former is designed for translation purposes and can quickly compute the most probable meaning, in the target language, of multi-sense verbs appearing in the same paragraph. The later is a text analysis system which performs syntactic and case-based seman...
متن کاملInferring a Probabilistic Model of Semantic Memory from Word Association Norms
In this paper, we introduce a method of data-analysis for word association norms. The defining characteristic of this method is that is based upon the inference of a probabilistic generative model of word-associations. The inferred model can in principle provide a clear and intuitive representation of the semantic knowledge inherent in word association data, facilitate an understanding of the p...
متن کاملPsychological norms for simple three-line graphic shapes.
There is a longstanding tradition in psychological research for norming lists of words that are used in experimental studies. The present study extends this practice to graphic imagery by obtaining norming data on 24 simple abstract graphic shapes composed of three straight-line segments. The attributes obtained in the norming procedure were the shapes' familiarity, describability, associabilit...
متن کامل